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[00011 The present invention reiates to audio user Interfaces In which services are represented by audio labels In 
an audio field. 

Background of the Invention 

r00021 The human auditory system, including related brain functions, is capable of localizing sounds in three dimen^ 

TO ani iS cues are complementary inasmuch as the former works better at low frequences and the latter better at 
SlSS^he primary cue for elevation depends on the ac^ 

Strength relative to the other sound frequencies being received. 
Range Cues - These Include: 

. loudness (the nearer the source, the louder it will be; however, to be useful, something must be known or assumed 
about the source characteristics), . 
motl Daillax (change in source azimuth in response to head movement .s range dependent), and 

- S2d aSuons I. nol.ar.rghl-l.~ara .«k ar,a generally requires an und.rsian*ng ol the tereg»ng '^■^^ 

For sound sources that have a fixed presentation (non-interactive), it is possible to produce convincing 3D 

rot" ?ornrbi:rsrunirr:eTbrva4^^^^^^^^^ .n a 3D audio «e«, a number of systems have evo.ed 
Related Impulse Response (HRIR). If the ^^"J; j^"™"^^^^ , ^a^ous source locations, the location of a 
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and used to try to cancel out cross-talk from the left speaker to the right ear and from the right speaker to the left ear. 
[0009] Other approaches to those outlined above for the generation of 3D audio fields are also possible as will be 
appreciated by persons skilled In the art. Regardless of the method of generation of the audio field, most 3D audio 
systems are, in practice, generally effective in achieving azimuth positioning but less effective for elevation and range. 

5 However, in many applications this is not a particular problem since azimuth positioning is normally the most important. 
As a result, systems for the generation of audio fields giving the perception of physically separated sound sources 
range from full 3D systems, through two dimensional systems (giving, for example, azimuth and elevation position 
variation), to one-dimensional systems typically giving only azimuth position variation (such as a standard stereo sound 
system). Clearly, 2D and particulahy 1 D systems are technically less complex than 3D systems as illustrated by the 

to fact that stereo sound systems have been around for very many years. 

[001 0] In terms of user experience, headphone-based systems are inherently "head stabilized" - that is, the generated 
audio field rotates with the head and thus the position of each sound source appears stable with respect to the user's 
head. In contrast, loudspeaker-based systems are Inherently "world stabilized" with the generated audio field remaining 
fixed as the user rotates their head, each sound source appearing to keep its absolute position when the hearer's head 

'5 is turned. In fact, it is possible to make headphone-based systems "world stabilized" or loudspeaker-based systems 
"head stabilized" by using head-tracker apparatus to sense head rotation relative to a fixed frame of reference and 
feed corresponding signals to the audio field generation system, these signals being used to modify the sound source 
positions to achieve the desired effect. A third type of stabilization is also sometimes used in which the audio field 
rotates with the user's body rather than with their head so that a user can vary the perceived positions of the sound 

20 sources by rotating their head; such "body stabilized" systems can be achieved, for example, by using a loudspeaker- 
based system with small loudspeakers mounted on the user's upper body or by a headphone -based system used in 
conjunction with head tracker apparatus sensing head rotation relative to the user's body. 

[001 1 ] As regards the purpose of the generated audio field, this is frequently used to provide a complete user expe- 
rience either alone or in conjunction with other artificially-generated sensory inputs. For example, the audio field may 
25 be associated with a computer game or other artificial environment of varying degree of user Immersion (including total 
sensory immersion). As another example, the audio field may be generated by an audio browser operative to represent 
page structure by spatial location. 

[0012] Alternatively, the audio field may be used to supplement a user's real world experience by providing sound 
cues and information relevant to the user's current real-world situation. In this context, the audio field is providing a 
30 level of "augmented reality". 

[0013] It is an object of the present invention to provide an audio user interface facilitating selection of resources. 

Summary of the Invention 

35 [0014] According to one aspect of the present invention, there is provided an audio user-interfacing method in which 
services are represented by audio labels presented in an audio field through respective synthesized sound sources, 
the method comprising the steps of: 

(a) storing, for each service to be represented, service access data and data associating the service with at least . 
40 one said sound source and specifying at least one audio label; 

(b) generating an audio field in whk:h said sound sources are synthesized at respective rendering positions to 
sound their associated service-representing audio labels; 

(c) selecting a represented service by identifying it through at least one of its sound source and audio label; 

45 the method further involving enabling a user to modify the audio-field layout of the service-representing sound sources 
and/or what services are represented in the audio field. 

[001 5] According to another aspect of the present invention, there is provided apparatus for providing an audio user 
interface in which services are represented by audio labels presented in an audio field through respective synthesized 
sound sources, the apparatus comprising: 

50 

a memory for storing, for each service to be represented, service access data and data associating the service 
with at least one said sound source and specifying at least one audio label; 

a generation subsystem for generating, through audio output devices, an audio field in which said sound sources 
are synthesized at respective rendering positions to provide sounds their associated service-representing audio 
S5 labels; 

a selection arrangement for selecting a represented service by identifying it through at least one of its sound source 
and audio label; and 

: user input means for enabling a user to modify the audio-field layout of the service-representing sound sources 
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and/or what services are represented in the audio field.. 
Brief Description of the Drawings 

100161 Embodlmems o, the invention wii. now be .escribed, by way o, non-i.miting example, with reterence to the 
accompanying diagrammatic drawings, In which. 

Pi«..r« 1 is a functional block diagram of a first audio-field generating apparatus 

■ ITZ I s a diaaram Illustrating a general cylindrical organization of an audio f.eld; 

■ F au c 7 is a diagram us rating a L specL form of the Figure 6 cylindrical organization: 

Figure 12 is a further diagram Illustrating the operation of the focus expanaer, in 

different direction to the audio field reference vector; apparatus; 

• F gu " 1 0 Ippa^^usTrtiting'the search space of a speech recognizer of the -PP— 

C;S"-LTJc°K^T:?a,^^^^^^^^^^^ 

R^-S is a diagram showing a trackball input device similar to Figure 20 but including a second form of visual 

FSut24tar;;m^^^^^^^ 
■ where the apparatus Is arranged to produce a cylindrical audio fieW; and 

Best Mode of Carrying Out the Invention 

10017, The fom,s of apparatus to be described below are °P-f;^°J^^^^^^ 

nterface to services such as communication services (for ^^^'^f^'^^'-^^^^^^l' '^^^^ Engines and individual 
ment services (such as internet radio), -'^-^^-Vr^d banana wtb sS a^Lm^^^^^ etc. 
documents), transactional services (for example, retail and bank.ng ^ fj'^^);^^^^^^^ , « eoriB- 

[0018] When the apparatus Is in a "desktop" "^^^'f^^^^^^^^^^^^^^^ The audio label associated 

spending synthesized sound source presenting «"J"^'°jf j^l^^^^^^^ that seo/ice - for example, 

with a service can be constituted by any or even a low-level 

an audio label can be the se-vice name, a short verbal ^^^^^"f^a t^e se^^^ synthesized to sound, to a user, 
audio feed from the service ttsetf. The sound sources ^ f ';®^^^^'^^^.^7_Xlisation method; these sound 
as though they exist at respective locations in the -<;o J^e^ ru^XT-e'fu^S^^^ are Involved In the 
sources do not individually exist as physical sound output "^^^'^f J^3„°' ^^^^ ^ real-world existence to the 
process of syrrthesizing the sound -"--/"r;^^^^^^^^^^^^ the concept of sound 
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in terms of such sound sources spatialized to specific locations in the audio field. 

[0019] Upon a service presented through a sound source being selected (in a manner to be described hereinafter), 
the apparatus changes from the desktop mode to a service mode in which only the selected service is output, a full 
service audio feed now being presented in whatever sound spatialisation Is appropriate for the service. When a user 

5 has finished using the selected service, the user can switch back to the desktop mode. 

[0020] It will be appreciated that other possibilities exist as to how the services are presented and accessed - for 
example, the feed from a selected service can be output simultaneously with background presentation of audio labels 
for the other available services. Furthermore, a service can provide its data in any form capable of being converted in 
audible fomi; for example, a service may provide its audio label in text fomn for conversion by a text-to-speech converter 

10 into audio signals, and its full service feed as digitised audio waveform signals. 

[0021] It is also possible in the desktop mode to use more than one sound source to represent a particular service 
and/or to associate more than one audio label with each sound source as will be seen hereinafter. 

Audio Field Organisation - Spherical Field Example 

15 

[0022] Considering now the first apparatus (Figure 1), in the form of the apparatus primarily to be described below, 
the audio field is a 2D audio field configured as the surface of a sphere (or part of a sphere). Such a spherical-surface 
audio field is depicted in Figure 2 where a spatialised sound source 40 (that is, a service audio label that has been 
generated so as to appear to come from a particular location in the audio field) is represented as a hexagon positioned 

20 on the surface of a sphere 41 (illustrated in dashed outline). It may be noted that although such a spherical surface 
exists in three-dimensional space, the audio field is considered to be a 2 dimensional field because the position of 
spatialised sound sources in the audio field, such as source 40, can be specified by two orthogonal measures; in the 
present case these measures are an azimuth angle X** and an elevation angle Y^. The azimuth angle is measured 
relative to an audio-field reference vector 42 that lies in a horizontal plane 43 and extends from the centre of sphere 

25 41 . The elevation angle Is the angle between the horizontal and the line joining the centre of the sphere and the sound 
source 40. 

[0023] In fact, the Figure 1 apparatus is readily adapted to generate a 3D audio field with the third dimension being 
a range measure Z, also depicted in Figure 2, that is the distance from the centre of sphere 41 to the spatialised sound 
source 40. Conversely, the Figure 1 apparatus can be adapted to generate a ID audio field by doing away with the 

30 elevation dimension of the spatialised sound sources. 

[0024] The Figure 1 apparatus supports azimuth rotation of the audio field, this potentially being required for imple- 
menting a particular stabilization (that is, for example, head, body, vehicle or world stabilization) of the audio field as 
well as providing a way for the user to explore the audio field by commanding a particular rotation of the audio field. 
As is illustrated in Figure 3, the azimuth rotation of the field can be expressed in terms of the angle R between the 

35 audio-field reference vector 42 and a presentation reference vector 44. This presentation reference vector corresponds 
to the straight-ahead centreline direction for the configuration of audio output devices 11 being used. Thus, for a pair 
of fixed, spaced loudspeakers, the presentation reference vector 44 Is the line of equidistance from both speakers and 
is therefore Itself fixed relative to the world; for a set of headphones, the presentation reference vector 44 Is the forward 
facing direction of the user and therefore changes its direction as the user turns their head. When the field rotation 

40 angle R=b°, the audio-field reference vector 42 is aligned with the presentation reference vector 44. The user is at 
least notionally located at the origin of the presentation reference vector. 

[0025] The actual position at which a service-representing sound source Is to be rendered In the audio output field 
(Its "rendering position") by the Figure 1 apparatus, must be derived relative to the presentation reference vector since 
this Is the reference used by the spatialisation processor 10 of the apparatus. The rendering position of a sound source 

45 is a combination of the Intended position of the source in the audio field judged relative to the audio-field reference 
vector, and the current rotation of the audio field reference vector relative to the presentation reference vector. 
[0026] As already Intimated, apart from any specific azimuth rotation of the audio field deliberately set by the user, 
the audio field may need to be rotated in azimuth to provide a particular audio-field stabilisation. Whether this is required 
depends on the selected audio-field stabilization and the form of audio output devices. Thus, by way of example, unless 

50 otherwise stated, it will be assumed below that the audio output devices 1 1 of Figure 1 apparatus are headphones and 
the audio field is to be body-stabilised so that the orientation of the audio field relative to the user's body Is unaltered 
when the user turns their head -this Is achieved by rotation of the audio field relative to the presentation reference 
vector for which purpose a suitable head-tracker sensor 33 is provided to measure the azimuth rotation of the user's 
head relative to its straight-ahead position (that is, relative to the user's body). As the user turns their head, the angle 

ss measured by sensor 33 is used to rotate the audio field by the same amount but in the opposite direction thereby 
stabilising the rendering positions of the sound sources relative to the user's body. 

[0027] It will be appreciated that had it been decided to head-stabilise the field, then for audio output devices in the 
form of headphones, it would have been unnecessary to modify the orientation of the audio field as the user turned 
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20 



audio output devices 11 '^^f" '"-"^J'^ltt ^^^^^^ orientation o, the audio tieid .ust be 

headphones are to be used and the audio field Is to wono s ,ho worid whether caused by the user turning their 
modified by any change In orientation of the user's head ^«J°«^^,^°^J^73°,'^^^^^^^ electronic compass. Similar^, 
headorbybodymovementsiasuitabiehead-trackercanbepr^^^^^^^ 

,f the audio output devices 11 ^^^'^''tr nS^aTv chanqe ^^^^^^^^^^^^ the vehicle as detem^lned by any 
the orientation of the audio field must be mod tied f "/f^^^^e m orientm ^^^^^ ^ ^^^^ 

suitable sensor. It may be a^^^'^l-y rpro3 g world stabilisation (whether the audio 
rotation relative to the world now being done with respect t^^^^'S^^ only concerned azimuth rotation - that is, 

in elevation in this way will be apparent to persons skilled art_ j,^., 
[0029] considering Figure 1 In more <l-f '^^f --^^^^ :r:2essibre vi^a c^^^^^^ 'ink 20 (such as a radio 

(for example, an application running on a local processor) "^^'^f^tlj'^'l^ ^^^^ conveniently be categorised into 
nk or fixed wire connection providing internet or '"f ^^^/^^ J^^^.^^^^^^^ ,° inity (augmentation services), 
general services such as e-mail, and --■>'''«f Jf^^^"^,^^^ "^^^^^^^^^^^ criteria and possibly 

The sen/ices are selected by selection control input unit or interactive display, 

also by real-time user input provided v,a any suitable 
10030? Amemo,y14isused,ostoredataaboutthese^«nedser>.c^s^ 

service ID. For each selected service, memory ^ ^^^^J^^^^^^^ be used to representthe service with 
URL) and data on the or each sound source ^P«'^''f^^7^*'J„f;,'^ ^^.^J,"^^^^^^^^ each sound source, the memory 
each such sound source being d'stingu^hed by a 3 further suffix to the suffixed service 

holds data on the or each associated audio "^^^^f^^^ '.f;' '"^^^^^^^ are either provided by the services 

,D used to identify the sound source. The «f '° '^J^«^J°^ '^^^ Scu'ar id^nWied services. The labels are preferably 
themselves to the subsystem 13 or are specrf.ed ''^ "["^f "Jf '^^^^^ (not shown) as and when required 

providedandstoredintext-fomriforconversiontoaud^byate^^^^^^^ 

by the spatiallsation processor. Where the audio 'J^f .^'^^l^J^^^ replacing the normal audio label of a 

is chosen by the user by input to output selection block 12^ ^^^^ t^^^^ 

[0032] Rather than the services to be represented in the ™ and the related sound-source 

currently found to be available, a set of services to and access data, 

data (including audio labels) forthese 'y^^^^?^;",^"^^^^^ set of seo^lces are repre- 

40 In this case, when the apparatus is in its desktop -""^f,- "'^^^^'^^^ '^J^^ ^^^^ contact the services concerned; 

rr.— 

s in its desktop mode, each servtoe may P!°^'^l^°^'''°ZZZcel tfbe rep eSed. or giving a real-world location 
in the audio field for the sound source(s) ^^^'l^^^^^"^^ o, an LgSeTrealrty servte^ associated with a 

ro^rrnrSty^ss.^^^^^^^ 

in memory 14 along with the audio labels ''^'^l^^;^ .^i;^^^ ,0 determine its final rendering position in the 
10034] For each service-representing sound 'i;^f "f/Jne J a sound-source data item into a 

output audio field taking account of a number of factors. This « done by '"le^'ng ^ 
prJSessing path involving elements 21 t° 30 Jh^ sound^^^^^^^ 

related suffixed service ID) for the sound source concerned ^^V^J^^f ^"PP^, ^ subsystem 13 passes each 
source, and possibly also the se^ice type (g^!^;.^^'^^;^,^"^^^^^^ of the sound source is decided 



25 



30 



45 

in 



50 



6 



EP 1 227 392 A2 



1 D, 2D, or 3D) of the audio field. The decided position for each source is then temporarily stored in memory 25 against 
the source ID. 

[0035] Provision of a user input device for modifying the position of each sound source relative to the audio field 
reference, enables the user to modify the layout of the service-representing sound sources (thai is, the dispositions of 

5 these sound sources relative to each other) as desired. 

[0036] With respect to a service having an associated real-world location (typically, an augmented reality service), 
whilst it is possible to position the corresponding sound source in the audio field independently of the relationship 
between the associated real-world location of the service and the location of the user, it will often be desired to place 
the sound source in the field at a position determined by the associated real-world location and, in particular, In a 

10 position such that it lies In the same direction relative to the user as the associated real-world location. In this latter 
case, the audio field will generally be world-stabilised to maintain the directional validity of the sound source in the 
audio field presented to the user; for the same reason, user-commanded rotation of the audio field should be avoided 
or inhibited. Positioning a sound source according to an associated real-world location is achieved In the present 
apparatus by a real-world location processing functional block 21 that forms part of the source-position set/modify 

15 block 23. The real-world location processing functional block 21 is arranged to receive and store real-world locations 
passed to it from subsystem 13, these locations being stored against the corresponding source IDs. Block 21 is also 
supplied on input 22 with the current location of the user determined by any suitable means such as a GPS system 
carried by the user, or nearby location beacons (such as may be provided at point-of-sale locations). The block 21 first 
detennines whether the real-world location associated with a servtee is close enough to the user to qualify the corre- 

20 spending sound source for inclusion in the audio field; if this test is passed, the azimuth and elevation coordinates of 
the sound source are set to place the sound source in the audio field in a direction as perceived by the user corre- 
sponding to the direction of the real world location from the user. This requires knowledge of the real-world direction 
of -pointing of the un-rotated audio-field reference vector 42 (which, as noted above, is also the direction of pointing of 
the presentation reference vector). This can be derived for example, by providing a small electronic compass on a 

25 structure carrying the audio output devices 11. since this enables the real-world direction of pointing of presentation 
reference vector 44 to be measured; by noting the rotation angle of the audio-field reference vector 42 at the moment 
the real-world direction of pointing of vector 44 is measured, It is then possible to derive the real-world direction of 
pointing of the audio-field reference vector 42 (assuming that the audio field is being wortd-stabilised). It may be noted 
that not only will there nonmally be a structure carrying the audio output devices 11 when these are constituted by 

30 headphones, but this is also the case in any mobile situation (for example, in a vehicle) where loudspeakers are in- 
volved. 

[0037] If the audio field is a 3D field, then as well as setting the azimuth and elevation coordinates of the sound 
source to position it in the same direction as the associated real-world location, block 21 also sets a range coordinate 
value to represent the real world distance between the user and the real-world location associated with the sound 
35 source. 

[0038] Of course, as the user moves in space, the block 21 must reprocess its stored real-world location information 
to update the position of the corresponding sound sources in the audio field. Similarly, if updated real-world location 
information is received from a service, then the positioning of the sound source in the audio field must also be updated. 
[0039] Returning to a general consideration of the Figure 1 apparatus, an audio-field orientation modify block 26 is 

40 used to specify any required changes in orientation (angular offset) of the audio-field reference vector relative to pres- 
entation reference vector. In the present example where the audio field is to be body-stabilized and the output audio 
devices are headphones, the apparatus includes the afore-mentioned head tracker sensor 33 and this sensor is ar- 
ranged to provide a measure of the turning of a user's head relative to their body to a first input 27 of the block 26. This 
measure is combined with any user-commanded field rotation supplied to a second input of block 26 in order to derive 

45 a field orientation angle that is stored in memory 29. 

[0040] As already noted, where headphones are used and the audio field is to be world stabilised (for example, 
where augmented-reality service sound sources are to be maintained in positions in the field consistent with their real 
world positions relative to the user), then the head-tracker sensor needs to detect any change in orientation of the 
user's head relative to the real world so that the audio field can be given a counter rotation. Where the user is travelling 

50 in a vehtele and the audio field is to be vehicle-stabilised, the rotation of the user's head is measured relative to the 
vehicle (the user's "local" world, as already noted). 

[0041] Each source position stored in memory 25 is combined by combiner 30 with the field orientation (rotation) 
angle stored in memory 29 to derive a rendering position for the sound source, this rendering position being stored, 
along with the source ID, in memory 16. The combiner operates continuously and cyclically to refresh the rendering 
55 positions in memory 15. 

[0042] Output selection block 1 2 sets the current apparatus mode according to user input, the available modes being 
a desktop mode and a service mode as already discussed above. When the desktop mode is set, the spatlalisation 
processor 10 accesses the rendering position memory 15 and the memory 14 holding the service audio labels to 
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10 



rendering position. .„,iofla..riio feed (or the chosen service Is rendered by the spatialisallon 

100431 When the sen/Ice mode is set, thef ul f '^'^^.^"^'°'f„^J^!°'*:^ ,t will be appreciated that, although 

Uoeisor10accordingtowhateverposltloninfom.at»ni^^^^^^^^^ 

not depicted, this service position ,nforn.at,on ^f^f ^^^^ "J^^^^^^^^^^^^^ ,he service audio labels; however, this 

!eed r^ay be provided as P--PatiaHzed -^^^^^^^^ body Sd lul ^e,d through headphones, the user car, 
10045] With the Figure 1 apparatus set »<> prov de a Jody staj ^^^.^ p.,g,,e 4 illustrates a 

explore the audio field in two ways, namely by tu ning J^f^" ^ a spherical surlace. In this case, 

us'er turning their head to explore a 2D aud. '^-^JJ^'^^^^^^^^^^^ is poLned in the audio field at an 

six spatialised sound sources ^0 are dep.cted. ^JJ^f reference vector 42. The user has not cor^- 

azimuth angle of X,° and eleva^on angle Y, ^^'^'^l™ angle X2° towards 

manded any explicit rotation of the audio ''^ "^^^"^.^^^'■/^^^^^^^^ ,he audio-field reference vector 42 has been 

the source 40A. °rder to maintain body-stab,l sat,ono^^^^^^^ 

rrnrrsSrhSd^^^^^^^ 

vector is therefore: 



Azimuth = X^** -X2 



30 Elevation = Yi** 



this being the pos.ion output by combiner 30 and stored in memo. 15. The resu. Is that turning o, the use.s head 

does indeed have the effect of turning t^^^^-^.^f ^"""'^^^^^'^^d in Figure 4, how the user can bring the sound 
35 [0046] Figure 5 illustrates, for the same audio field J^^^^^^";^';" /x of the audio field by user input 

source 40A to a position directly ahead of the user by f "9 ^ muth rendering position of the sound souk« 

28 to block 26 (effected, for example by a ^"^^^V ^^^^^ reference vector 

40 faces ahead again. 

A..Hi^ ci»iH nr ^anlsation - ^"nririn»l Field Example 

[0047] The Figure 1 apparatus can be ^^^^^ ^t:eT3e~aV^^^^^^^^^^^^^^ 
the surface of a vertically-orientated cylinder .f^^'^^^,,,' ,L the spherical audio field previously de- 

conforms to a notional cylindrical surface 50. Th^^'=y ' ^^^^^^ of a sound sourx:e 40 in the field it can 

scribed with reference to Figure 2, is two d,mens«r^al '"^^^^^^^J'^^P^i^J^..^ (height) distance Y, both measured 
be specified by two coordinates, namely an a^'"""* «",X,fj,^^"^^^^^^^^ 3D audio field can be specmed by 

relative to an horizontal audio-field reference vector ^-^l^^^^^^^'^f^^^^^^^^ audio field. As with the spherical 
adding a range coordinate Z, this being the ^^^s J"- ^^J^ ang^ter J offset by angle R') relative to a pres- 
audlo field described above, the cylindrical audio field "^ay be rotated (ar^ y ^^^j^^^ ^ particular 

entation reference vector 54. this being '^--'^'^^^^^^^^^ field. In addmon. the audio 

;;:srrr;rrdrorn7e^^^^^^^^^^^^ 
rrr^Sv^grr^^^^^^^^ 
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is a zone of the cylindrical audio field bounded by upper and lower elevation values determined by a currently com- 
manded height H so as to keep the focus zone fixed relative to the assumed user position (the origin of the presentation 
reference vector); within the focus zone, the sound sources 40 are active, whilst outside the zone the sources 40 are 
muted (depicted by dashing of the hexagon outline of these sources in Figure 6) except for a limited audio leakage 56. 
5 in Figure 6, the focus zone (which Is hatched) extends by an amount C above and below the commanded height H 
(and thus has upper and lower elevation values of (H + C) and (H - C) respectively. In the Illustrated example, H=0 and 
C Is a constant; C need not be constant and it would be possible, for example, to make Its value dependent on the 
value of the commanded height H. 

[0049] The general form of cylindrical audio field shown in Figure 6 can be implemented in a variety of ways with 

10 respect to how leakage into the focus zone Is effected and how a user moves up and down the cylindrical field (that 
is, changes the commanded height and thus the current focus zone). Figures 7 and 8 illustrate two possible Implemen- 
tations in the case where the audio field is of semi-cylindrical form (azimuth range from +90" to -90°). 
[0050] In Figure 7, leakage takes the fonn of the low-volume presence of sound sources 40W in upper and lower 
"whisper" zones 56, 57 positioned adjacent the focus zone 55. Also, the commanded height value is continuously 

15 variable (as opposed to being variable in steps) . The result is that the user can effectively slide up and down the cylinder 
and hear both the sound sources 40 In the focus zone and, at a lower volume, sound sources 40W in the whisper zones. 
[0051] In Figure 8, the service sound sources are organised to lie at a number of discrete heights, in this case, four 
possible heights effectively corresponding to four "floors" here labelled "1" to "4". Preferably, each "floor" contains 
sound sources associated with services ail of the same type with different floors being associated with different service 

20 types. The user can only command step changes in height corresponding to moving from floor to floor (the extent of 
the focus zone encompassing one floor). Leakage takes the form of an upper and lower advisory sound source 60, 61 
respectively positioned just above and just below the focus zone at an azimuth angle of 0°. Each of these advisory 
sound sources 60, 61 provides a summary of the services (for example, in temns of service types) available respectively 
above and below the current focus zone. This pennits a user to determine whether they need to go up or down to find 

25 a desired service. 

[0052] It will be appreciated that the forms of leakage used In Figures 7 and 8 can be interchanged or combined and 
that the Figure 8 embodiment can provide for sound sources 40 on the same floor to reside at different heights on that 
floor. It Is also possible to provide each floor of the Figure 8 embodiment with a characteristic audio theme which rather 
than being associated with a particular source (which is, of course, possible) is arranged to surround the user with no 

30 directionality; by way of example, a floor containing museum services could have a classical music theme. 

[0053] In arranging for the Figure 1 apparatus to implement a cylindrical audio field such as depicted in any of Figures 
4-6, the positions set for the sound sources by block 23 are specified in terms of the described cylindrical coordinate 
system and are chosen to conform to a cylindrical or part-cylindrical organisation in 1 , 2, or 3D as required. The ori- 
entation and vertical positioning of the audio field reference vector 42 are set by block 26, also in terms of the cylindrical 

35 coordinate system. Similariy, combiner 30 is arranged to generate the sound-source rendering positions in terms of 
cylindrical coordinates. The spatialisation processor must therefore either be arranged to understand this coordinate 
system or the rendering positions must be converted to a coordinate system understood by the spatialisation processor 
" 10 before they are passed to the processor. This latter approach is preferred and thus, in the present case, assuming 
that the spatialisation processor is arranged to operate in terms of the spherical coordinate system illustrated in Figure 

40 2, a converter 66 (see Figure 9) is provided upstream of memory 15 to convert the rendering positions from cylindrical 
coordinates to spherical coordinates. 

[0054] Whilst it would be possible to use a single coordinate system throughout the apparatus regardless of the fomri 
of audio field to be produced (for example, the positions of the sound sources in the cylindrical audio field could be 
specified in spherical coordinates), this complicates the processing because with an appropriately chosen coordinate 

45 system most operations are simple additions or subtractions applied independently to the Individual coordinates values 
of the sound sources; in contrast, if, for example, a spherical coordinate system is used to specify the positions in a 
cylindrical field, then commanded changes in the field height (discussed further below) can no longer simply be added/ 
subtracted to the sound source positions to derive their rendering heights but instead involve more complex processing 
affecting both elevation angle and range. Indeed, by appropriate choice of coordinate system for different forms of 

50 audio field, equivalent operations with respect to the fields translate to the same operations (generally add/subtract) 
on the coordinate values being used so that the operation of the elements 25, 26, 29 and 30 of the apparatus is 
unchanged. In this case, adapting the apparatus to a change in audio-field form, simple requires the block 23 to use 
an appropriate coordinate system and for converter 66 to be set to convert from that coordinate system to that used 
by the spatialisation processor 10. 

55 [0055] With respect to adaptation of the Figure 1 apparatus to provide the required capability of commanding changes 
in height for the cylindrical audio field systems illustrated in Figures 4-6, such height changes correspond to the com- 
manding of changes In the elevation angle already described for the case of a spherical audio field. Thus, a height 
change command is supplied to the block 26 to set a field height value (an axial offset between the field reference 
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source to derive the elevation value for the rondcnn P°s «o^^^^^^^^^ 9 ,,pi,,, a suitable variation 

[00561 AS regards ho^ the focus zone and leakage 
oftheFigurelapparatusforprovidingthesefeaturesJnpart^^^^^^^^ 

between the output of combiner 30 the^^^^^^^^^^^ fac^sou Scu^e to condition how the sound source is to be 
or modifying one or more parameters associated ^^^^ 10 apparatus, the block 70 can 

presented in the audio field. As will be seen JJl^";;;^^^^^ source and/or set various sounding 

Include a range of different type of units that may '"^^ V ''l^.^^^^//'^^;^^^^^^^^ ,i,ter 71 that sets a audibility 
effect parameters for the source. In the present case he ,alue is passed to memory 1 5 for 

(volume level) sounding-effect parameterfor «««='^^^°""'^Xn t,e s^^^^^^^^ comes to render the sound 

Storage along with the source ID and P^^^^^ 1 5, it passes the audibility 

source audio label according to the position and audibility ^^^^^^^^^ ^^^^ ^^se, sets its volume level), 
value to a sounding effector 74 that conditions the -'^j' ^er fK^ttspo,^^^^^^ he current field height value 
[00571 in the case of the Figure 7 arrangement, 1^/ ^J^J.^^^ ^J^l ^h^^^^^^^^^ parameter value of each sound 

whisperzones. „„^^«„t th*. cylinder filter 71 performs a similar function except that now there 

s^sx:r.s°'r,:s=;?.r^"— ^^^^ 

creates these sources by: 

to block 23 as for any other semce; ^.,^^^rv audio labels each set being stored in memory 14 

floor, depending on the set concerned. 

H ,« .hP hiock 23 are there associated with null position data before being passed on via memory 
The source IDs passed to the block 23 ^-^^ there assoc recognises the source IDs as upper and 

25 and combiner 30 to arrive at the cylinder filter 71 of ° .^^^ " '^^^^ as setting the audibility parameter 

lower advisory sound source IDS and appropnate^ set^^^^^^^ 

srsSrarpre^^^^^^^^^ 

ST^^li .e appreciated that partially or ful. mutin, sound ^e^^^^^^^^^^^ 

Where the apparatus is set to ^^^-/^^P^.t"^,^^"^ "^^^ beyond a specked angular 

but now the cylinder filter 71 is replaced by a "spherical filter '"9 f " .^^ ^„ j^e presentation reference 

distance from a current facing direction of the ^^''^^^ ^^"^""^^'"^ nted that in the cale where the audio output 
vector is derived by block 26 and « ^lied to the fitter 71_ It m^ be n^^^^^^^^^^ 

devices 1 1 are constituted by headphones, ^^^t*;'*''''"^" rendering positions that are more than a given 

Multi ple Audio Sub-Fields 

[0060] Figure 10 shows a second apparatus ,or producing an «f <> "^'^^^^^fo?^ 

This apparatus is similar to the Figure 9 variant of the fi^ app^^^^^^ ^^^jra rpr^ntaUon. Elements of the 
has a variety of sound-source parameter """^'^^l^^'l^ reference numerals and their 

SSc^TwrnS^^^^^^^ 

accommodate features of the second apparatus. oroducina (part) spherical or part (cylindrical) 

by block 23. rr^nitinif* "ciub-fields" Each sub-field may be considered 
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changing the offset between the presentation reference vector and an audio-field reference specific to the sub-field. 
Further, each sub-field can have a different stabilization set for it - thus, for example, sound sources representing 
genera! services can be assigned to a head-stabilised sub-field whilst sound sources representing augmented-reality 
services can be assigned to a world-stabilised sub-field. The rotation/displacement of each sub-field and the setting 

5 of Its stabilization is done by block 26 with the resultant values being stored in nnemory 29. Whether or not the block 
26 modifies the azimuth-angle value of a sub-field to reflect a sensed rotation of the user's head will thus depend on 
the stabilization set for the sub-field and, as already described, on whether the audio output devices are head-mounted, 
body-mounted, vehicle-mounted or fixed with respect to the world (or, In other words, whether the presentation refer- 
ence vector is head, body, vehicle or world stabilised). To add flexibility to the Figure 10 apparatus, the current stabi- 

10 lisation of the presentation reference vector is fed to the block (see arrow) to enable the latter to make any appropriate 
changes to the sub-field orientations as the user turns (and/or nods) their head. 

[0063] Each service sound source is assigned by block 23 to a particular sub-field and an identifier of its assigned 
sub-field is stored with the source ID in memory 25 along with the position of the sound source relative to the audio- 
field reference associated with the assigned sub-field. The combiner 30 Is supplied from memory 29 with the rotation/ 
15 displacement values of each sub-field and for each service sound source combines the values of the related sub-field 
with the sound-source coordinate values; as a result, each sound source is imparted the rotations/displacements ex- 
perienced by its sub-field. For each service sound source, the output of the combiner comprises source ID, position 
data, and sub-field identifier. 

[0064] As will be seen below, assigning sound sources to different sub-fields may be done for reasons other than 
20 giving them different stabilizations; for example, it may be done to identify a group of service sound sources that are 
to be subject to a particular source-parameter modification process in block 70. 

[0065] It should also be noted that different sub-fields may have different dimensions and even different forms so 
that one sub-field could be a 2D spherical surface whilst another sub-field could be of 3D cylindrical form. 

25 Facilitating Clear Presentation 

[0066] As well as the cylindrical filter 71 . the source parameter set/modify block 70 includes a number of sound- 
source parameter conditioning units 80 to 85 for facilitating a clear audio presentation. The function of each of these 
units will be described more fully below. It is to be understood that the units need not all be present or operational 

30 together and various combinations of one or more units being concurrently active are possible; however, not all com- 
binations are appropriate but this is a matter easily judged and will not be exhaustively detailed below. Also, certain 
units may need to effect their processing before others (for example, units that affect the final rendering position of a 
sound source need to effect their processing before units that set sounding effect parameters in dependence on the 
final rendering position of a sound source); again, it will generally be apparent when such ordering issues are present 

35 and what ordering of the units is required to resolve such issues and an exhaustive treatment of these matters will not 
be given below. 

[0067] Unit 80 is a focus expander that serves to modify the rendering positions of the sound sources to spread out 
the sound sources (that is, expand or dilate the audio field) Jn azimuth in the region of the current direction of facing 
of the user (or other appropriate direction) in order to facilitate discrimination between sound sources. Referring to 

40 Figure 11 , this shows a field of 180° extent in azimuth with the user currently facing in the direction of the audio-field 
reference vector 90. The focus expander 80 operates to lineariy expand the 15*' segments 92 on both sides of the 
facing direction 91 into respective 45** segments 93 (see the hatched zones). The remaining segments are correspond- 
ingly compressed to maintain an overall 180*" azimuth range • in this case, this results in two 75° segments 94 being 
compressed into respective 45** segments 95; as an alternative (not illustrated), the remaining segments could simply 

45 be angularly displaced from their normal positions without compressing them. 

[0068] For sub-fields that are head-stabilised, turning of the user's head does not change the 1 5** segments subject 
to expansion; however, azimuth rotation of such a sub-field does result in the expansion being applied to different 
segments of the sub-fieid. 

[0069] For sub-fields that are not head-stabilised, as the user turns their head, the segments subject to expansion 
so change. This is illustrated in Figure 12 where a user has turned to the right 75° relative to the audio-field reference 
vector of a body-stabilised audio sub-field with an initial ± 90° range either side of the reference vector. This results in 
the most clockwise 30° of the original field (segments 92) being expanded (symmetrically with respect to the facing 
direction) so that now the audio sub-field extends round further in the clockwise direction than before. The remaining 
150° segment 97 of the original audio sub-field is expanded into a 90° segment 98. 
55 [0070] In order for the focus expander 80 to effect the required processing of the azimuth rendering positions of the 
sound sources, It is supplied (input 78 to block 70) with the angle of the facing direction relative to the current presen- 
tation reference vector, this angle being detenmined by the block 26 in dependence on the current stabilization of the 
presentation reference vector and the sensed head rotation. Of course, where the presentation reference vector is 
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the rendering positions ot the other sound sources ^^^^^l^^^^ [^^^l^^ .^r 80 on and o« as desired, it is 
[00711 it wlli be appreciated that the user f " ''^J^^'^'^f 'V^",' more selected sub-fields ralherlhan to all 
also possible to arrange torthe tocus expander o be aPPl'^d only to ^° ^ on azimuth 

tields indiscrlrr^lnately. Furthermore, whilst the '^^^ ^'^P^^^^^^^^ values (whether angles or 

reference direction' ). such as a spec.hc ^^^t^^J^'" ^ enable it to detemiine the relative offset 

focus reference direction is set to be the P'«««"*^"°" whether for controlling the focus expander 

10073] Arrow 79 in Figure 10 genera^^^^^ 

raibXrrso=:=^ 

all sub-ftelds IS preferably set In the ™er 81 ^ ^se'™ soun^s^^^^^^^^^ either direction) be^«een un-muted 
segment muting filter changes the aud b Irty "^9^.^"' edibility (sound volume) parameter of the sound 

and at least partially muted by «PP;°P-»f ^ ^^^^"9 the ° .f^^^^^^^'^J^^ „, ,,,.o sub-field, of 180" azimuth 

dt^d into five segments relative to the audio-field reference vector, namely. 

. an "ahead ' segment 1 01 extending in azimuth '^"'^f «: '° "^O*! 

- a "left" segment 102 extending in azimuth fronr. -30 to -60 

. a 'larler segment 103 extending in azimuth from -60 to -90 . 

- a "right" segment 1 04 extending in azimuth from -»-30 to +60 , 

. a "far righr segment 1 05 extending in azimuth from +60° to +90 . 

Thefi.terB1actstochangetheaud..l^parameterofea^^^^^^^^^^ 
, and 0% (or a preset low level) in ^««P°"f »° "^J".^^^^^ these'sounds sources as muted by 

sources in segment 1 02. the user need only ^^^V Jl^J^^^J^^^^s to volume, the user says "Un-Mute Left". As 
showing them in dashed outline). To bnng ''^^ >^ le specked by the audibility parameter is 

rerryiordi;r£^^ 

XTts to be Lpplled with the sound label for t^e soun^^^^^^^^^^^ 
[0S75I Preferably.thesegmentscanbemutedandun-rnu^^^^^ 

lor only one segment to be muted at « ^-^^^^^^^^^^^^^ wUh on ly o^e segment'belng un-muted at a time, the un- 
previously muted segment; the opposite is also ^''''^ J^"^^ ^^ ,3 ^i^o possible to arrange for several 

muting of a segment causing any previously un-muted s«g'"®""°^!,3^'': '1 'tl^^L both the "left" and "far lett" 
Segments to be muted simultaneous^ in -«P»"«Vn bVrlTutedr^P^^^^^^^^ a user Command of "Mute All Left", 
segments 102. 103 in Figure 13 could be ^^^^^^^^^^^ ^ZeluX relative to the audio-field reference 
[0076] The segments are pre-specif.ed in temis 0 ^^^' .^'Xr^ e^^ere^ In order for the segment muting filter 
vecto4 by segmentation data stored In the ^^^-^^^^ "^"^^f 'l.'J'^J f,"^^^^^ know the current azimuth angle 

to mute the sound sou«=es corresponding to a segm^^^^^^^^^^^ 
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can be specified relative to the facing direction of the user (which may, in fact, be more natural). In this case, the 
segnnent muting filter needs to know the angle between the current facing direction and the presentation reference 
vector; as already described, this angle Is provided on input 78 to block 70. A further alternative is to pre-speclty the 
segments relative to the presentation reference vector (which, of course, for headphones is the same as specifying 
5 the segments relative to the user's facing direction). 

[0078] The specification data that specifies the segments is preferably, though not necessarily, set in by a user, or 
is at least modifiable by a user; to this end. suitable user input means (not shown) are provided for specifying/changing 
the specification data. Default specification data can be factory set. 

[0079] Whilst segment muting has been described using segmentation in azimuth, it will be appreciated that the 

10 segmentation can be effected in any appropriate manner (for example, in azimuth and elevation in combination) and 
the term 'segment' Is herein used without any connotation regarding the form or shape encompassed. 
[0080] Rather than a segment remaining muted until commanded to return to its un>muted state, a muted segment 
can be arranged only to stay muted for a limited period and then to automatically revert to being un-muted. 
[0081] Unit 82 is a cyclic muting filter. As depicted in Figure 14 (whteh uses the same field development as Figure 

IS 13), this filter 82 works on the basis that the sound sources 40 are divided Into groups 110 to 114 and the filter 82 
operates cyclteally to change the audibility state of the sound sources so as to at least partially mute out all but one 
group of sources in turn - in Figure 14, all groups except group 111 are currently muted. The un-muted group remains 
un-muted, tor example, for 10 seconds before being muted (partially or fully) again. As with the segment muting filter, 
the filter 82 operates by setting the value of an audibility parameter of each sound source. Rather than requiring a 

20 group ID to be assigned to each sound source and transferred along with the sound-source ID. position data, and sub- 
field identifier to the block 70, grouping can be achieved by assigning a separate sub-field for each group. 
[0082] The grouping of sound sources can be effected automatically by service type (or more generally, one or more 
characteristics associated with the item represented by the sound source concerned). Alternatively, the grouping of 
the sound sources can be effected automatically according to their positions in the audio field (possibly taking account 

25 their relation to the presentation reference vector, the audio field reference vectors, or user direction of facing). A further 
possibility is for the grouping to be user specified (via block 23). In one possible grouping arrangement, each sound 
source is assigned to a respective group resulting in each sound source being un-muted in turn. Preferably, the user 
can also specify that one or more groups are not subject to cyclic muting. Additionally, the user can be given the option 
of setting the un-muted duration for each group. 

30 [0083] As already indicated, muted groups need not be fully muted. Where the sound sources are assigned to groups 
according to their positions, a possible muting pattem would be to fully mute sound sources in groups lying either side 
of the currently un-muted group of sources, and to partially mute the sound sources of all other groups. 
[0084] Rather than the un-muting and muting of the groups being effected in an abrupt manner, the group whose 
limited period of being un-muted is ending can be cross-faded with the group whose period of being un-muted is next 

35 to occur. 

[0085] Unit 82 is a collection collapser the basic purpose of which is to respond to a predetermined user command 
to collapse all sound sources that are members of a specified collection of sound sources to a single collection-repre- 
senting sound source at a particular location (which can be head, body, vehicle or worid stabilised). The member sound 
sources of the collection can be Identified by a specific tag associated with each sound source ID; however, It Is con- 
40 venlent to assign all sound sources to be collapsed to the same sub-field and simply rely on the sub-field ID to identify 
these sources to the block 70. 

[0086] Figure 15 illustrates the general effect of the collection collapser 82 for a situation where all augmented-reality 
sound sources 40[AR] are members of the same collection and have been assigned to the same worid-stabilised sub> 
field; these augmented-reality sound sources are arranged to be collapsed to a single collection-representing sound 

45 source 1 20 positioned at the top center of the audio sub-field. Other positions for the source 1 20 are, of course, possible 
such as In line with the current direction of facing or the location of a particular one of the sound sources being collapsed. 
[0087] The collection collapser is further arranged to reverse the collapsing upon receipt of a suitable user command. 
The collection-representing sound source 120 will generally not be present when the member sound sources of the 
collection are un-collapsed though it is possible to leave the collection-representing sound source un-muted to serve, 

50 for example, as notification channel to infonm the user of events relevant to the collection as a whole. 

[0088] In a typical implementation, the collection-representing sound source is created by the subsystem 13 and is 
given an ID that indicates Its special role; this sound source is then assigned to the same sub-field as the collection 
member sound sources to be collapsed. The collection-representing sound source Is also given its own audio label 
stored in memory 14 with this label being arranged to be temporarily substituted for by any notifications generated in 

55 relation to the collection member sound sources (each sound source is also arranged to have its normal label tempo- 
rarily replaced by any notification related to that source). Whilst the collection member sound sources are not collapsed, 
the audibility parameters of these sound sources remain at 1 00% but the collection-representing sound source has its 
audibility parameter set to 0% by the collection collapser. However, when the collection collapser 83 Is triggered to 
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either by location or In some other manner) occurrina when the collection Is being collapsed and un- 

[0090] To provide clear feedback to the use as to what s ^^^^^^^fJJ^ between its two states non-lnstan- 
collapsed. the collection collapser Is preferably ^'-^"f^^ '^^^^^^^^^^ fo exlple. during collapse, the collectlon- 
taneously and with the accorr^paniment of ^PP^P"!*^. ^^^J^^^' /^^^^ sources are faded out. This can be ac 
representing sound source can be faded up «^ ♦^^^.'^''f'^i^^^^ sound sources are notionaliy being 

companied by a sound such as a sucking .n ^^"^^"^^J^'^^,^ .ocations of the member sound sources can 

. -re^'r:ror^Ttrtr^::i'^ 

f^rurar:ci^^^^^^^ 

?rrAs..ardsthenon-collectionsounds^^^^^^^^^^^^^^^ 

. Changes in the state of the collection. " sound sources. For example, upon un- 

,0 be modified to adapt to the presence or ^''^^^^'I'^^^'^Zo^Uer^ a member sound source appears m the 

collapsing of the collection, the location of ^'^l^^^^l^'^^^ stund so^^^^^^ 
audiofieldcanbechangedtoensuream.n,mumsepa^auonj^^^ 

of the collection the other sound sources can be P^'^^^'^^^l^^^^i^^ t^an just a way of opening an audio menu 
„ 10093] it will be appreciated that the collection ^P^.fl P"*^''*^^J/'„73,^"^ of the collection member 

Ihere the member sound sources represent '--^^^l^'^^i^^l^^^^,^^^^ 

. [oTJ^rUnltMisasub-fleldsoundsenerlntende.^^^^ 

of a particular sub-field or sub-fields. The sound se"^' °perat^e to P corresponding sound 

either on or off for each sound source. «hHst the sound.n^^^^^^^ TdeSt.whenlhe sound setter Is enabled 

effect to ail sound sources for which the parameter isset to on^PreU^aoiy.a ^^^wever, the user can de- 

rhetound sources of all sub-fie.ds have the -'^^^^sS*^;"^^^^^^ 
.0 selectoneormoresub-fieldsforthisueatmen^^^^^^ 

each associated with a different sound ^«^^*^ J^P ^f' ^"""J^J^^ ^oise or other distinctive sounds, etc. 
thesised sounds. ^ u « „io «nnmnrlaie inout means, what sound effect is to be used to 
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sound effect. . arouo of sounds and another (such as real and synthesised 

[0097] in fact, another way of distinguishing ben^een one f °"P f ^ne of the group of sound sources 

Lnds) is by way of specify ing a particular stabil^^" '^^^^^^^^^^ 

to be distinguished. Thus, audio labels "dttaSs^^^^^ th^t they move relative to the real world as the 

assigning the audio-label sound sources j^^^^^^^^^^^ ''J^'J" services could be assigned to a head-stab.Used 
user turns their head. As another «««"^P'^; J ^^^^J* ^^r^^^ sub-field. As a refinement to always 

^^n^ttsr^birri^^^^^^^^^^^ 
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associated with the presentation reference vector between updates). 

[0098] Unit 85 is a range sound setter and is applicable only where an audio sub-field has depth (that is, the range 
paranneler can be different for different sound sources of the sub-field). The range sound setter, when enabled in 
respect of a sub-field, is operative, for each sound-source In the sub-field, to set a sound source parameter according 

5 to the range of the sound source. The purpose of doing this Is to Impart an audible characteristic to the sound source 
that indicates to the user at least a general range of the sound source. This parameter could, for example, be the 
audibility parameter with the value of this parameter being set such that sound sources at a greater range are presented 
at a lower volume. However, In a preferred embodiment, the value of the parameter controlled by unit 85 is used to 
select which audio label to render from a set of audio labels associated with a sound source, each label having a 

10 different presentation character at least one aspect of which, other than or additional to loudness, differs between 
labels. This aspect Is. for example, speaking style, vocabulary, speaker voice, etc. The mere change in a range value 
included in an announcement is not considered to be a change in the presentation character of the announcement. 
[0099] The user can readily learn to associate the differing presentation characters with particular range bands. 
Figure 16 illustrates an example conceming a sound source for an augmented-reallty notification service from the 

15 user's local newspaper shop; this service sound source has three associated audio labels, stored for it in memory 14, 
of increasing familiarity the closer the sound source is to the user: 





Audio label 


Range 
extent 


>Z2 


**Excuse me Sir, would you like your newspaper?" 


Zl - Z2 


"Hello Mr Smith, yournewspaper" 


0-Zl 


Hi, John. Paper!" 



The unit 85 sets a label-selection parameter for the sound source according to its range and the relevant label is then 
used by the spatialisation processor 1 0. Assuming that the newspaper notification service has indicated the real-world 
30 location of the newspaper shop to the apparatus, the processing block 22 can continuously update the position of the 
notification-service sound source in the audio field to reflect the movement of the user in the vicinity of the shop. As a 
result, the notification audio label will change as the user approaches the shop (or moves further away). Preferably, of 
course, the notification-service sound source is assigned to a world-stabilized sub-field with the position of the service 
sound source being set to be in the same direction for the user as the shop itself. 
35 [0100] In a variant of the arrangement described above, rather than the sound sources presenting audio labels for 
services that have associated real-world locations, the sound sources can be arranged to present audio labels for real 
world entitles with real-world locations, the range of the sound sources in the audio field being typically, though not 
necessarily, set to represent the actual distance between the user and the real-world location of the entity concerned. 
Indeed, the concept of using announcements each of a different character to indicate distance between the user and 
40 . a sound source can be applied whatever entity, real or virtual, is being represented by the sound source; in this context 
the term "virtual entity" means any non-real-world entity such as a service, a data item, or application. 
[01 01 ] The concept of using announcements each of a different character to indicate distance can be further applied 
to situations beyond the current context of a spatialised audio field. For example, user-carried equipment can simply 
be arranged to make a succession of non-spatialised audio announcements, each with a differing presentation char- 
ts acter, as the user approaches a particular real-world location or a device in relation to which range measurements can 
be made in any suitable manner. 

[0102] Figure 17 shows a further example beyond the context of a spatialised audio field. In this example, a fixed 
device 125 with speech output capability is anranged to sense the approach of a person 126. As the person 1 26 moves 
closer to the device 125 (the user's movement track is represented by dashed line 127 in Figure 1 7), the range of the 

so user from the device crosses range trigger values Z6, Z5 and Z4 (in decreasing range order) triggering a respective 
audio announcement having a range-dependent character. As with the Figure 16 arrangement, the formality of each 
announcement decreases with distance (this merely being illustrative of one way in which range changes can be 
indicated to the person 126). The sensing of the distance between person 126 and device 125 can be done in any 
suitable manner such as by using fixed sensors, round-trip time measurements for signals sent from the device and 

55 returned by equipment carried by person 126 (with known internal processing delay), by a local radio location system 
interacting with equipment carried by person 126, etc. - in general terms, range determination is done by range-deter- 
mining equipment at one of the entity, the user, and generally In the environment, either alone or in coopeijation with 
auxiliary range-determining equipment at another of the entity, the user, and generally in the environment. 
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operable device or a voice-input device set to recognise appropriate commands. For a 2D spherical field, the apparatus 
is arranged to pemrilt control of both the azimuth angle and elevation angle of the audio-field reference vector relative 
to the presentation reference vector; for a 2D cylindrical field, the apparatus is set to permit control both of the azimuth 
angle of the field and of its height (elevation). This permits any point (and thus any sound source) In the field to be 

5 brought into line with the predetenmlned selection direction by rotations/displacement commanded by input device 1 36. 
[0110] A selection-direction comparison unit 137 of the source parameter set/modify block 70 is fed with an input 
138 from block 26 indicating the angular offset between the selection direction and the presentation reference direction 
(this offset is readily detemiined by block 26 from the inputs it receives). Given this infomnatlon, unit 137 detemilnes 
If any sound source in the audio field lies In the selection direction (or within a defined angular distance of It) and, if 

to so, sets a selection parameter of that sound source to tme', resetting the parameter to 'false' upon the sound source 
ceasing to be in alignment with the selection direction. The unit 1 37 operates on basis of the rendering position of each 
sound source after any processing by other units of block 70 that may affect the rendering position of that sound source. 
The unit 137 may also set a sounding effect parameter for the sound source to give a distinctive sound for that source 
in order to indicate to the user when a sound source lies in the selection direction. 

15 [01 1 1] The input device 1 36 as welt as enabling the user to rotate/displace the audio field, also enables the user to 
indicate that a sound source lying in the selection direction is to be selected. This Indication is generated, for example, 
using a selection button or upon recognition of a command word such as 'select', and results in a corresponding signal 
being fed on line 139 to a mode and source control block 128 of the output selection block 12. On receiving this signal, 
block 128 accesses the memory 15 to determine which sound source, if any. currently has its selection parameter set 

20 to 'true'; provided such a source is identified, the block 1 28 switches the apparatus from its desktop mode to its service 
mode and instructs the spatiaiisation processor 10 on line 129 to output a full service feed for the identified servbe 
sound source. 

[0112] It may be noted that when the apparatus is in Its desktop mode, at any given moment some of the sound 

sources may be in a fully muted state due to operation of units of the source parameter set/modify block 70. Since it 
25 is unlikely that a user will intentionally be trying to select such a muted source, when the mode and source control 
block 128 accesses memory 15 to Identify a sound source lying in the selection direction, it is preferably arranged to 
ignore any muted sound source, notwithstanding that the source lies in the selection direction. 

[0113] The fact that the Figure 1 0 permits the presence of multiple sub-fields has two consequences for the above- 
described selection technique. Firstly, it will generally be desirable for the Input device 1 36 to be able to rotate/displace 

30 any desired one of the sub-fields independently of the others; however, when the user wishes to move a sound source 
to lie in the selection direction, it is simplest to arrange for all sub-fields to be moved together by devk;e 136. Secondly, 
with multiple sub fields that are independently movable, it is possible that multiple sound sources can lie in the selection 
direction at the same time; in order to cope with this, block 128 can operate any suitable prioritisation scheme to choose 
between such sound sources or can present the choice of sources to the user to allow the user to select the desired 

35 one of the sources lying in the selection direction. 

[0114] With regard to the selection direction comparator unit 137 setting a sounding effect parameter to give an 
audible indication to the user when a sound source lies in the selection direction, the operation of unit 137 can be 
refined also to adjust a sounding effect parameter to indicate when a sound source is near the selection direction, the 
adjustment to the sound effect being such as to provide an indication of the direction In which the sound source needs 

40 to be moved to come into alignment with the selection direction. 

[01 1 5] The second selection technique to be described uses an audio cursor. This cursor is a special sound source 
that Is arranged to be rotated/displaced by a cursor control input device 140 which, like input device 136, can take any 
suitable form; indeed, devices 1 36 and 140 can be combined with a mode control for switching between the respective 
functions of the two devices. For the Figure 10 apparatus, one straight-fonward way of implementing the audio cursor 

45 is as a sound source aligned with the audio-field reference vector of a dedk;ated sub-field; in this case, the output of 
the cursor control input device is fed to block 26 to rotate/displace that sub-field (from which it can be readily seen that 
the function of input device 140 can easily be effected by input device 136). Preferably, the audio-cursor sub-field is 
arranged not to move with the other sub-fields and to be body stabilised. An alternative audio cursor implementation 
Is for the input device 140 to directly set the position of the audio-cursor sound source relative to the presentation 

50 reference vector, this being the implementation depicted in Figure 18 where a block 141 uses the output from device 
140 to calculate the current cursor position. With either Implementation, the current rendering position of the cursor is 
fed to the source parameter set/modify block 70 where it is stored in a memory 144. 

[0116] A cursor sound setter unit 145 of block 70 compares the position of the cursor against the final rendering 
position of each sound source (the unit 145, like the unit 137, is thus arranged to operate using the rendering position 
55 of each sound source after any processing by other units of block 70 that may affect the rendering position of that 
sound source). If no sound source is close to the cursor's current position, a cursor-sound parameter is set to a cor- 
responding value and is passed, along with the cursor ID and rendering position, via the converter 66 to memory 15. 
The spatiaiisation processor, in conjunction with sound effector 74, then causes a distinctive cursor sound to be gen- 
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For example, the audible indication can comprise a first, non-varying, element indicative of the general proximity of 
the cursor to an item-representing sound source, and a second, continuously variable, element indicating the separation 
distance between the cursor and the sound source. These elements can emanate from the cursor or the sound source 
concerned or divided between the two. 

[0126] Where the audio sub-fields are of 3D form, it is possible to arrange for the audio cursor to be moved In the 
third (range) dimension. This can most conveniently done where, as shown in Figure 1 8, the cursor-control Input device 
140 is used to directly set the cursor position relative to the presentation reference vector; in this case, the input device 
is simply further arranged to set the range of the audio cursor and this range value is stored in memory 144. In order 
to provide the user with an indication of the range of the audio cursor, the cursor sound setter unit 145 is preferably 
arranged to set the value of a sounding effect parameter of the cursor according to the current range of the cursor 
(regardless of the proximity of any sound source), the sounding effector 74 then producing a correspondingly modified 
sound for the cursor. For example, where the sounding effector produces a tone to represent the cursor, the volume 
of the tone can be adjusted, via an audibility parameter, to reflect the current range position of the cursor (the greater 
the range, the quieter the cursor sounds). Alternatively, the frequency of the cursor tone can be varied with the current 
range of the cursor. 

[0127] It may be noted that the focus expander 80 can conveniently be linked to the audio cursor to expand the 
region of the audio field about the cursor rather than about the current direction of facing of the user as was earlier 
described. In this case, the unit 80 is supplied with the current cursor position from memory 144 rather than with the 
current facing direction of the user. 

[0128] The third selection technique is based on the use of a speech recogniser 150 to determine when the user is 
speaking the sound label of a sound source, the speaking of such a label being taken to be an indteation that the user 
wishes to select the source. 

[0129] Speech recogniser 150 has speech input 151 and associated vocabularies that define the words between 
which the recogniser is to distinguish. In the present case, the vocabularies associated with the speech recogniser 
include a command vocabulary (stored in memory 152) holding command words such as "desktop" (to return to the 
desktop mode); "louder" and "softer" (to generally increase and decrease volume levels); "rotate left", "rotate right", 
"up", "down" (where sub-field rotation is to be effected by spoken command), numbers 1 to 10 (to identify sub-fields), 
etc. The audio labels held in memory 14 also define a vocabulary for the recogniser, the phonetic contents of the label 
words being made available to the recogniser through an appropriate reference database (not shown). In the event 
that a sound source has its associated label constituted by an audio feed from the source or by non-word sounds, then 
the label memory is preferably arranged to store appropriate words that the user might use to select the source, these 
words being advantageously supplied by the related service when first selected by subsystem 13. 
[0130] In order to facilitate the operation of the speech recogniser 150, various measures can be taken to the reduce 
the search space of the recogniser (that is, the range of words with which it tries to match a spoken word received via 
input 151 ). In the present case, three different restrictions are applied to the search space though it is to be understood 
that these restrictions can equally be applied in isolation of each other. These restrictions are: 

(i) A restriction to sound sources positioned within a range gate determined by the loudness of the spoken input 
(this restriction is only relevant where the audio sub-field(s) have depth - that is, a spread of range values). As- 
suming that the user knows the general range of the sound source the user wishes to select, then the user can 
speak the audio label of the source at a loudness volume reflecting the range of the source. Typically, the user will 
speak the label of a nearby source louder than that of a more distant one - the underlying model here is that the 
user is reflecting the fact that nearby sound sources are generally louder a the user than far away ones. However, 
it would also be possible to use the opposite scheme where the user speaks louder for further way sources - here 
the underiying model is that the user needs to speak louder in order for the remote source to 'hear*. The loudness 
of the speech Input is measured by block 154 and converted to a range gate. Figure 19 shows an example rela- 
tionship between loudness and range that can be used by block 154; in this case, for a received loudness of LI , 
a range gate G is determined corresponding to equal increments AL either side of LI . The derived range gate G 
is passed to a restrictions application block 155 that accesses memory 15 to determine which sound sources lie 
within this range gate. The recogniser search space is then restricted to the labels (or other identification words) 
associated with the sound sources within the range gate. To help the user speak a label at the correct loudness, 
it is possible to provide a calibration mode of operation (selected in any suitable manner) in which when a user 
speaks a word, that word (or another sound) Is rendered In the audio field at a range corresponding to that assessed 
by the loudness-to-range classifier 154; the Implementation of this feature is straight-fonward and will not be de- 
scribed in further detail 

(ii) A restriction to sound sources that are currently audible. This restriction is implemented by block 155 which 
accesses memory to detemnine whether the current value of the audibility parameter of each sound source is such 
as to permit It to be heard. The recogniser search space Is then restricted to the labels (or other identifteation 
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markings) is sufficient to uniquely determine the orientation of the trackball. This permits the trackball to be marked in 
a human visible manner to indicate to the user the current orientation of the trackball and thus the commanded rotation 
of the audio field - where no stabilisation offset is applied by block 26, this orientation directly corresponds to that of 
the audio field relative to the presentation reference vector (this would be the case, for example, where headphones 
5 are being used and the audio field is head-stabilised). By way of example, the eight quadrants of the trackball can each 
be given a respective colour with the aforesaid sensing pattern being marked out using infrared or magnetic Inks; 
Figure 20 depicts the applteation of different markings (such as colours) to different quadrants with three such quadrants 
166, 167, and 168 being visible, 

[0140] Directly marking the outside of the trackball to indicate orientation has a disadvantage in that if the trackball 
10 161 is allowed to be rotatable about all three axes 162-164, then rotations about all axes must be measured and 
corresponding rotations effected to the audio field - if this is not done, the markings on the trackball will quickly cease 
to correspond to the orientation of the audio field. Whilst it is possible to engineer restrictions on the rotation of the 
trackball so that it can only rotate about the two desired axes, an alternative and preferred approach Is to provide a 
visual orientation indicator arrangement that uses the sensed rotation of the trackball to determine the orientation to 
15 be indicated by the arrangement. Such an arrangement avoids the need to match the orientation of the trackball with 
that of the audio sphere and it is possible to use a conventional two-axis rotation sensing arrangement that simply 
measures angular changes (rather than absolute orientations) potentially with slippage. 

[0141] One suitable form of fixed visual orientation indicator an-angement is illustrated in Figure 21 that shows a 
trackball-based input device 1 70 similar to that of Figure 20 but without quadrant markings on the surface of its trackball 

20 171 ; instead, a row of indicator lights 173 (typically LEDs) is provided. Each LED 1 73 represents a respective quadrant 
of the audio field, the quadrant concerned being depicted, for example, by a graphic adjacent the LED. The activation 
of the LEDs is controlled to indicate the cun-ent commanded orientation of the audio field as known to block 26 of the 
Figure 1 0 apparatus. Thus, as a commanded rotation of the audio field brings the presentation reference vector within 
a quadrant of the audio field (assuming, for the moment, no stabilisation rotation of the audio field), the block causes 

25 the LED 1 73 con-esponding to that quadrant to be activated, all other LEDs being deactivated. 

[0142] Rather than arranging the LEDs 173 in a row, different coloured LEDs (or other light emitting devices) could 
be grouped together Inside the trackball itself, the latter being translucent or transparent so the user can see the colour 
of the currently activated LED and therefore gain an indication of the current orientation of the audio sphere. This latter 
configuration requires an appropriate arrangement for powering the LEDs inside the trackball and this can be achieved 

30 either by an arrangement of sliding contacts or by flexible wiring runs and physical limiters on the movement of the 
trackball to prevent excessive twisting of the wiring. In a further alternative embodiment of the indicator arrangement, 
the trackball surface is covered with a layer the visual properties of which can be altered by control signals; in this 
manner the visual appearance of the trackball provides the desired orientation indication. 

[0143] Rather than the visual orientation indicator arrangement indicating the orientation of the audio field relative 

35 to the presentation reference vector without regard to any stabilisation rotation of the audio field (that is, only indicating 
the commanded rotation), it is preferable to arrange for the indicator arrangement to indicate the audio-field orientation 
relative to a selected "indicator reference" direction (for example, the presentation reference vector, the current facing 
direction of the user, the forward-facing direction of the user, a world-fixed direction such as North, or a vehicle straight- 
ahead direction for in-vehicle audio systems) with account being taken, where required, of any rotation of the audio 

40 field effected to give it a specified stabilisation. The required output indication from the indicator arrangement is deter- 
mined, for example, by block 26 and may require information (rotation of the user's head relative to their body, rotation 
of the user's head relative to the worid or to a vehicle, rotation of the user's body relative to the worid or to a vehicle) 
not available from any sensors currently being used for achieving a specified audio-field stabilisation sensors - in such 
cases, the appropriate sensors will need to be provided to supply the required infonnation to the block 26. 

45 [0144] Basically, in order for the block 26 (or other processing means) to appropriately control the visual orientation 
indicator arrangement, it needs to know about any changes in the offset between the audio field reference and the 
presentation reference vector (either user commanded or required to achieve a particular stabilisation), as well as any 
changes in the orientation of the indicator reference direction relative to the presentation reference (caused, for exam- 
ple, by rotation of the user's head or body). In certain cases, at least components of the changes in the offset between 

50 the audio field reference and the presentation reference vector required to achieve a particular stabilisation in the 
presence of rotation of the user's head/body, will match the changes in orientation of the indicator reference relative 
to the presentation reference resulting from the rotation of the user's head/body. In such cases, it is only necessary to 
take account of the unmatched components (notably, but not in all cases exclusively, the user-commanded component) 
of the offset between the audio field reference and the presentation reference. In implementing block 26 (or other 

55 processing means) for detemnining the orientation between the audio-field reference and the indicator reference di- 
rection, it is not, of course, necessary first to determine the offset between the audio field reference and the presentation 
reference vector and the orientation of the indicator reference relative to the presentation reference, before going on 
to determine the orientation between the audio-field reference and the indicator reference direction; instead the various 
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be activation of the relevant LED. A set of LEDs can be provided for this purpose in device 160 of Figure 20. 
[0150] Another suitable form of fixed visual orientation indicator arrangement is illustrated in Figure 23 that shows 
a trackball'based input device 180 in which a small display panel 185 is mounted to show a depiction of that part of 
the audio field lying either side of the Indicator reference direction. This depiction preferably gives both an Indication 
5 of the portion of the audio field concerned (for example. In terms of field coordinate ranges, or a quadrant name), and 
an Indication of the sound sources in this portion of the audio field. The orientation of the audio field can be Indicated 
by other types of diagram or image displayed on display panel 185. 

[01 51 ] The Figure 23 input device also includes, as well as a trackball 1 81 , a set of LEDS for indicating, in the manner 
described above with reference to Figure 21 , when a new sound source or new notification is available. 

10 [0152] Figure 24 shows a form of input device 1 90 specifically adapted for use with cylindrical audio fields though 
also usable with other fields. The input device 190 comprises a cylinder 191 that can be moved by hand back and forth 
along a shaft 192 coaxial with cylinder 191 (see dashed arrow 193) as well as rotated (see dashed arrow 194) about 
the shaft. Both the position of the cylinder 191 along the shaft 192 and the angular position of the cylinder 191 about 
the shaft are measured by suitable sensor arrangements (for example, electro-optical sensors) and are respectively 

15 used to set the height and azimuth angle of the cylindrical field being controlled. The cylinder 191 canries an index 
marking 195 that cooperates with a fixed scale 196 to indicate the current height of the audio field. Further markings 
(not shown) on the cylinder can be used to indicate the current azimuth setting of the audio field. A set of LEDs 198 
(or other light output devices) can be used to indicate the presence of a new sound source or of a new notification, the 
LED 198 activated being dependent on the height of the sound source concerned ( the scale 196. or other markings, 

20 can be used to indicate the height significance of each LED). 

[0153] With the form of the input device 190 shown in Figure 24, because the azimuth orientation of the audio field 
is Indicated by markings carried by the cylinder 1 91 , only the offset between the audio-field reference and presentation 
reference can be indicated and this without any account being taken of rotation of the audio field to achieve a particular 
field stabilisation. To overcome these limitations, the input device 1 90 can be provided with any of the above-described 

2s f onns of visual orientation indicator arrangements controlled by block 26 to give the field orientation relative to a given 
Indicator reference direction. 

[0154] It will be appreciated that the above-described forms of visual orientation indicator arrangements controlled 
by biock 26 (or other processing means) to give the field orientation relative to a given indicator reference direction, 
can be implemented separately fronn the input devices themselves. Furthermore, the visual orientation indicator ar- 

30 rangements can still be employed where the user is not provided with means to change the offset between the audio 
field reference and the presentation reference (though , of course, there is little point in doing this in the above-mentioned 
cases where the user-commanded input was the only variable component of the orientation of the audio field reference 
relative to the indicator reference). Finally, it may be noted that the orientation of the audio-field reference relative to 
the indicator reference may have one, two or more degrees of freedom and the visual orientation indicator arrangement 

35 is therefore preferably correspondingly adapted to be able to indicate all degrees of orientation changes. By way of 
example, where a head-stabilised audio field is presented through headphones and the indicator reference direction 
is the current facing direction, then if only azimuth changes are involved for user-commanded rotations, for audio-field 
stabilisation and in detennlning the current orientation of the indicator reference relative to the audio field, then the 
orientation of the audio field relative to the indicator reference has only a single degree of freedom; however, if, for 

40 example, the user-commanded inputs can also change the elevation between the audio field reference and the pres- 
entation reference, then the orientation of the audio field relative to the indicator reference will have two degrees of 
freedom. The visual orientation indicator arrangement can, however, be restricted to indicate less than all of the degrees 
of freedom associated with the orientation of the audio field relative to the indicator reference. 

[01 55] Each of the input devices 1 60. 1 70, 1 80 and 1 90 also includes a selection button, respectively 1 65, 1 72, 1 82, 
45 and 1 97 for enabling the user to indicate that they wish to select a particular service either lying in the selection direction 
or overlaid with the audio cursor. Where sub field rotation/displacement (including rotation/displacement of a cursor 
sub-field) is to be controlled by any of the devices, then that device is preferably also provided with means for selecting 
which sub field is to be controlled; these means can take any suitable fonn such as selection buttons, a rotary selector 
switch, a touch screen selection display, etc. Similarly, selection means can be provided to switch between audio (sub-) 
so field control and cursor control where the cursor, instead of being associated with a sub-field, has its rendering position 
directly controlled by the input device. Further selection means can be provided to enable a user to select a particular 
indicator reference direction from a set of such directions which block 26 is set up to handle. 

[0156] The input devices described above are suitable for use with 2D audio fields. The devices are also suitable 
for 3D audio fields where the field/audio cursor is not required to be moved in the third (range) dimension. Where 
55 exploration in the third dimension is required (such as when an audio cursor is to be moved back and forth in the Z or 
range dimension), each device can be provided with a range slider generating an output signal in dependence on the 
position of a slider along a track. 
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can output an audio label giving information about what is at the map location and also give instructions as to whether 
the user needs to turn their head left or right to look directly in the direction of the map location. Another possible 
function would be to tell the user what is ahead in their current facing direction or current direction of travel. 
[0163] It will be appreciated that most of the functionality of the functional blocks of the various forms of apparatus 
5 described above, will typically be implemented in software for controlling one or more general-purpose or specialised 
processors according to modem programming techniques. Furthermore, whilst a number of separate memories have 
been illustrated the described embodiments, it will be appreciated that this is done to faciiitate a clear description of 
the operation of the apparatus; memory organisations and data structures different to those described above are, of 
course, possible. 

10 [0164] It should also be understood that the term "services" as used above has been used very broadly to cover any 
resource item that it may be useful to indicate to the user In much the same way as a PC visual desktop can be used 
to represent by visible Icons a wide variety of differing resource items including local software applications and individual 
documents as well as remote services. However, as illustrated by the above-described ghost services, the described 
forms of apparatus can also be used to present items that are not simply place-holders for underlying services but 

15 provide useful infonnation in their own right. 

[0165] It will be appreciated that many features of the above-described apparatus are capable of implementation 
independently of the present invention and In various implementations. The following paragraphs set out a number of 
these features in terms of their important elements, the features being expressed in method terms although it is to be 
understood that corresponding apparatus-type statements of the features are also possible. 

20 [0166] One features provides an audio user-interfacing method in which items are represented in an audio field by 
corresponding synthesized sound sources from where sounds related to the Items appear to emanate, the method 
comprising the steps of: 

(a) determining, for each said sound source, an associated rendering position at which the sound source is normally 
25 to be synthesized to sound In the audio field; 

(b) modifying said rendering positions of at least some of the sound sources to dilate a region of the audio field 
set relative to a focus reference; and 

(c) generating, using audio output devices, an audio field in which said sound sources are synthesized at their 
associated rendering positions after any modification effected by step(b). 

30 

[0167] Another feature provides an audio user-interfacing method in which items are represented in an audio field 
by corresponding synthesized sound sources from where sounds related to the items appear to emanate, the user 
being able also to hear real-world sounds from the environment; the method including the step of selectively applying, 
under user control, a distinctive presentation effect to the item-related sounds emanating from a group of at least one 
35 synthesised sound source whereby to assist the user in distinguishing these sounds from said real-world sounds. 

[0168] A further feature provides an audio user-interfacing method in which items are represented in an audio field 
by corresponding synthesized sound sources from where sounds related to the items appear to emanate, the method 
including the steps of: 

40 (a) associating at least some of the sound sources into a collection of which they are members; and 

(b) changing the collection in either direction between: 

an un-coltapsed state in which the member sound sources are present un-muted in the audio field; 
a collapsed state In which the member sound sources are muted and a collection-representing sound source 
45 provides an audible presence for the collection in the audio field. 

[0169] A further feature provides an audio user-Interfacing method in which Items are represented in an audio field 
by corresponding synthesized sound sources from where sounds related to the items appear to emanate, the method 
including the steps of: 

50 

(a) allocating the sound sources to groups of at least one sound source; and 

(b) automatically and cyclically changing the audibility of the sound sources such as to un-mute the sound sources 
of each group in turn for a limited period with the sound sources of the groups other than the un-muted group being 
at least partially muted. 

55 

[0170] A further feature provides a user-interface method in which items are represented to a user with respective 
perceivable range values, the items having respective associated labels by which they can be addressed, the method 
involving: 
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provide sounds related to the items concerned; 

(b) detenmining the relative location of a particular said sound source in the audio field with respect to a predeter- 
mined reference; 

(c) providing an audio indication of said relative location of said particular sound source, as determined in step (b), 
5 as a said sound emanating from that sound source. 

[0176] A further feature provides an audio user-Interfacing method in which items are represented in an audio field 
by respective synthesized sound sources from where sounds related to the items appear to emanate, the method 
comprising the steps of: 

10 

(a) setting the location of each sound source relative to an associated one of multiple audio-field references; 

(b) Independently controlling an offset between each audio-field reference and a presentation reference determined 
by a mounting configuration of audio output devices through which the sound sources are rendered in the audio 
field, 

15 (c) detemnlning a rendering position for each sound source based on its location set in step (a) and the offset of 

the associated audio-field reference; 

(d) rendering said sound sources at their associated rendering positions In the audio field. 

[0177] A further feature provides a user-interface method in which items are represented in an audio field by corre- 
ct) spending synthesized sound sources from where sounds related to the items appear to emanate, the method including 
the steps of: 

(a) varying an offset between an audio-field reference relative to which the sounds sources are located In the audio 
field, and a presentation reference determined by a mounting configuration of audio output devices through which 

25 the sound sources are synthesised; and 

(b) determining and visually indicating the orientation of the audio-field reference relative to a predetermined Indi- 
cator reference taking account, at least at a component level, of any change in value of said offset and any change 
in value of indicator-reference orientation relative to the presentation reference, at least where such changes do 
not match each other. 

30 
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1 . An audio user-interfacing method in which services are represented by audio labels presented in an audio field 
35 through respective synthesized sound sources, the method comprising the steps of: 

(a) storing, for each service to be represented, service access data and data associating the service with at 
least one said sound source and specifying at least one audio label; 

(b) generating an audio field In which said sound sources are synthesized at respective rendering positions 
40 to sound their associated sen^ice-representing audio labels; 

(c) selecting a service by identifying it through at least one of its sound source and audio label; 

the method further involving enabling a user to modify the audio-field layout of the service-representing sound 
sources and/or what services are represented In the audio field. 

45 

2. A method according to claim 1 , wherein step (c) involves rotating and/or displacing the audio field to bring the 
sound source of a target service to lie in a predetemilned selection direction. 

3. A method according to claim 1 , wherein step (c) involves moving an audio-cursor sound source In the audio field 
50 to align it with the sound source of a target service. 

4. A method according to claim 1 , wherein step (c) involves the user speaking the audio label of a target service, and 
using a speech recogniser to match, where possible, this spoken label to the stored audio labels. 

55 5. A method according to claim 1 , wherein the selection of a service in step (c) results In the service-representing 
sound sources being replaced by an audio interface to the service. 

6. A method according to claim 1 , wherein the access data of a said service is the path name on a local machine of 
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a service executable file. 

. .„.^.^P«»o.»™-'— ^ 

- application sollware; 

. a communication service; 

- an entertainment service; 

. a database-based information resource; 
. a file infomnation resource; 

- a transactional service; 

. an augmented-reality service. 

field with at least two degrees of freedom. 
, .™,..»o««»c« 

. u nninn aclivitv and provides notification of 
source associated with the service. ,,„.oan 

sources are synthesised. 

a user's head; 
- a user's body; 

. a vehicle in which the user is travelling; 
the world; 

■ • .7nrdaim18 wherein said Offsets varied in response to user ir^put via an input 
19. A method according to claim 17 or daim 1 8, wherein sa 
device 

are represented in the audio field. 
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audio field through respective synthesized sound sources, the apparatus comprising: 

a memory for storing, for each service to be represented, service access data and data associating the service 
with at least one said sound source and specifying at least one audio label; 

a generation subsystem for generating, through audio output devices, an audio field In which said sound sourc- 
es are synthesized at respective rendering positions to provide sounds their associated sen/ice-representing 
audio labels: 

a selection an^angement for selecting a represented service by identifying it through at least one of its sound 
source and audio label; and 

user input means for enabling a user to modify the audio-field layout of the service- representing sound sources 
and/or what services are represented in the audio field.. 

22. Apparatus according to claim 21 , wherein the selection arrangement comprises means for rotating and/or displac- 
ing the audio field to bring the sound source of a target service to lie In a predetermined selection direction, and 
means for Inputting a select command to select a service whose sound source is aligned with said selection di- 
rection. 

23. Apparatus according to claim 21 , wherein the selection arrangement comprises means for moving an audio-cursor 
sound source in the audio field to align it with the sound source of a target service, and means for inputting a select 
command to select a service with which the sound source is aligned. 

24. Apparatus according to claim 21 , wherein the selection arrangement comprises speech input means for recognising 
an audio label spoken by the user and selecting the corresponding service. 

25. Apparatus according to claim 21, further comprising means for responsive to the selection of a sen^lce by the 
selection arrangement to replace the audio field of service-representing sound sources with an audio interface to 
the selected service. 

26. Apparatus according to claim 21 , wherein the access data of a said service is the path name on a local machine 
of a service executable file. 

27. Apparatus according to claim 21 , wherein the access data of a said service is the address of a service resource 
on a remote machine to be accessed over a communications connection. 

28. Apparatus according to claim 21 , wherein said services comprise one or more of the following service types: 

application software; 

a communication service; 

an entertainment service; 

a database-based information resource; 

a file information resource; 

a transactional service; 

an augmented-reality service. 

29. Apparatus according to claim 21 , wherein the generation subsystem is such that the rendering positions of the 
sound sources are specified in the audio field with at least two degrees of freedom. 

30. Apparatus according to claim 21 , wherein at least one said audio label Is a verbal service name or descriptor 

31 . Apparatus according to claim 21 , wherein at least one said audio label is an audio feed from the service concerned. 

32. Apparatus according to claim 21 , wherein at least one said audio label is a distinctive sound or sound sequence. 

33. A method according to claim 21 . further comprising means for enabling a user to specify said audio labels. 

34. A method according to claim 21 , further comprising means for receiving audio-label specifying data from the cor- 
responding service and storing it in said memory. 
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